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Abstract. We show that all multivariate Extreme Value distributions, which 
' ~H are the possible weak limits of the K largest order statistics of iid samples, have 

, ^ the same copula, the so called K-extremal copula. This copula is described 

through exact expressions for its density and distribution functions. We also 
study measures of dependence, we obtain a weak convergence result and we 
CN| propose a simulation algorithm for the K-extremal copula. 

1. Introduction 

In the study of extremes of iid sequences a question of interest is whether or 
not the dependence relation among the marginals of the limit distribution of the 
K largest order statistics relies on the parent distribution function of the sequence. 
One way to evaluate nonlinear dependence between random variables is through the 
copula associated to them, this is already discussed in several books as the ones by 
J> Joe [7], Nelsen [lOj and Drouet-Mari and Kotz 5J. In the present paper, we show 

that every multivariate extreme value distribution, which are the possible weak 
limits of the K largest order statistics of iid samples, have the same copula called 
the if-extremal copula. From the Extremal Types Theorem, see below, extremal 
distributions are obtained from linear transformations of one of three basic distribu- 
tions. We prove that the copula for the three basic types is the iiT-extremal copula, 
thus all K-dimensional multivariate extremal distribution have the same nonlinear 
dependence among its marginals. This is not remarkable since the copula for any 
group of order statistics of an iid sample of size n with continous parent distribu- 
tion do not denpend on this distribution, see Lemma 6 in [T]. However, a proper 
caracterization of the K-extremal copula is relevant as well as their consequences. 



(N 



> 

X 

Jh Our result generalizes the case K = 2 which was considered in [9]. 

The X-extremal copula is described by its distribution and density functions 
through exact expressions. We show that the copula of the K largest order statistics 
of iid sequences with continuous parent distribution converges in distribution to the 
if-extremal copula. We also study the assymptotic behavior of Spearman's rho and 
Kendall's tau for the first and the K largest order statistics. As a last result, we 
propose a simulation algorithm to sample from the A'-extremal copula. 

In section [2] we will present and discuss the results in this paper postponing all 
the proofs to section [3] 
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2. Statements 



Fix an interger K >2. For every n> K, let Mi^„, ... , MK,n be the K largest 
order statistics of an iid sample of size n with parent distribution not depending 
on n. The Extremal Types Theorem, see sections 2.2 and 2.3 in [8 and section 4.2 
in [6], states that if for some sequences of real numbers (a„)^]^ and (6„)5^i the 
random variables a„Mi „ + &„ converge in distribution then the random vectors 

(an Ml anMK.n + hn) (2.1) 

also converge in distribution. The limit belongs to a family of distributions parametrized 
by — oo < /i < oo, CT > and — oo < ^ < oo. For a choice (/i,cr, ^) of the param- 
eters, the marginals of a limit distribution have distribution function and density 
functions given respectively by 

exp{-A(.2)} ^"J-Q^ ^ , if e (^) > -1 for e ^ or z e M for e = 
G^(z) = { ' , if < Ai- f for e > 



if z > - I for ^ < 0. 



(2.2) 



and 



^(^) ^ I exp{-A(z)}^^(^M£ril , if ? (^) > -1 for e ^ or z G M for e - 
1 , otherwise, 

(2.3) 



where 



A(2) = A5^p^^(z) = 




if^^O 
if e = 0, 



for m < 1. A distribution with distribution function as above is called a General- 
ized Extreme Value (GEV) distribution which are classified in types I, II and III 
according respectively to ^ = 0, ^ > and ^ < 0. Note that the function A is 
strictly decreasing positive function and satisfies 

lim A(z) = +00 and lim A(z) =0, if ^ = 

lim A(z) = +00 and lim A(z) =0, if ^ > (2.4) 

lim A(z) = +0O and lim A(z) = 0, if ^ < 0. 

Also by the Extremal Types Theorem, the joint density function of a limiting 
extreme value distribution for normalized sums of the K largest order statistics of 
an iid sequence, as in (2.1 1, is given by 

ciKizu ...,zk) = [ ^"^^"^ cxp{-A(zK)} nf=i A'(z,) , if {zi, ...,zk) € 
' ' 1 , otherwise. 



(2.5) 



where 



, ife = o 

n^=\ {(zi,...,zk) e : zi > ... > > A*- f} , if ^ > 

> zi > ... > zk} , if ^ < 0. 
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A distribution with density as in (2.5) for parameters —oo < /i < oo, cr > 
and — oo < ^ < cx) is called a Multivariate Generalized Extreme Value (MGEV) 
distribution. 

Remark 2.1. A broader class of stationary sequences of random variables have a 
MGEV distribution as the assymptotic distribution of the largest maxima. These 
sequences should satisfy some weak dependence condition. The results can be found 
for instance in [B] . 

Our first result gives an explicity expression for the distribution function associ- 
ated to the density gx- 

Proposition 2.1. The distribution function Gk of a limiting extreme value dis- 
tribution for a normalized vector of the K largest order statistics of iid continuous 
random variables has the following representation 

Gk{zi,-; zk) = min(zi, Z2), min(zi, Z2, 2:3), min(zi, zk)) , 

for every (zi, ...,zk) G M.^ , where 

Hk(zi, Zk) = exp{-A(zK)} J/f (A(zi), K{zk)) 
/or min(zi, ...,z_R-) > f; */C > 0; or /or min(zi, z/f) < ^ - |, if ^ < 0, or 
(zi, Zk) G R-^ , if £^ = 0, otherwise Hk{zi, zk) = 0. The function Jk ■ IR+ ^■ 
is a polynomial in K variables which is defined by induction by putting Ji = 1 

and 

m—l j m—1 

Jmixi,...,Xm) = X! ^ ~ X! -^Jm-jiXj + l,...,Xm), form > 1. 
j=0 ^' 3=1 ^' 

We can now compute the density of the copula associated to the density gK of 
a MGEV distribution, which we call the K-extremal copula and tuns out to not 
depend on the parameters ^, ji and cr. 



Proposition 2.2. The density of the copula of a MGEV distribution is given by 
ck{ui,...,uk) = I Yl ^^^^^(wj) I 'j-^{uk) (2.6) 



for {ui, ...,uk) G (0,1)^ such that ui > V-'2(u2) > ■•• > i^Kiux), where ipm ■ 
(0, 1) — ^ (0, 1) is the increasing function that satisfies the following implicit equation 

I ( ^'v^^ (log?/'m(u))-' 
M = ^ , (2.8) 



otherwise cx('Ui, uk) ~ 0. 



Remark 2.2. The function t\)m which appears in the expression for the density 
of the K-extremal copula can be obtained from a MGEV distribution function as 
i^miu) — exp{— A(G'~^(u)} for every u e (0, 1) and m > 1. 
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Also with the distribution function of the MGEV distribution, it is straightfor- 
ward to write the distribution function of the K-extremal copula which we present 
in the next result. 

Proposition 2.3. The copula of a MGEV is given by 

Ck{ui, ....uk) = HKiui,ri{ui,U2),r2{ui,U2,U3), ...,rK-i{ui, ...,uk)) ■ 
for every {ui, ...,uk) G [0,1]^, where 

/ N i-ln i \\ It \V^\ (logV'!(M/)V 
r,„_i(ui,...,M,„) \Wi\ui)) =tpi[ui) 2_^(-^y ^ , 

3=0 ^' 

if ipiiui) = min('0i (ui ),..., V'm(Mm)) and for every {ui,...,uk) such that ui = 
V'i(ui) > ■i/'2(w2) > ••■ > ^k{uk) 

Hxiui, ....uk) = iPk{uk)Jk log ui, - log 7/12(^2), ...,-\ogipK{uK)) , 

= UK-TpKiuK) ^ :j Jx_j(-l0gV'i + l(Wj + i),...,-l0gV'if(uK)) 



with Jm defined in the statement of Proposition 2. 1 



By a simple generalization of Lemma 6 in [Tj, we have that the multivariate 
copula of the K largest order statistics of an iid sample of size n do not depend on 
the continuous parent distribution of the sample. This copula will be denoted by 
cj^"*, where n denotes the size of the sample. The next proposition is a convergence 
result for copulas that has the consequence that for continuous distributions the 
non-linear dependence structure of the K-largest order statistics of large iid samples 
is approximatedly captured by the K-extremal copula. 

~ in) 

Proposition 2.4. The copula Cj^ converges in distribution to Ck as n 00. 

From the X-extremal copula we can obtain the copula between the 1 largest and 
the m largest limiting order statistics for every choice of I and m, or between any 
two marginals of a MGEV distribution. Then we can use these bivariate copulas 
to obtain measures of dependence as the Spearman's rho and Kendall's tau. For a 
copula C, the Spearman's rho is defined by 

12/ ( C{u,v)dudv-3^12 [ [ uvdC{u,v)-3 
Jq Jo Jo Jo 

and Kendall's tau by 

4 [ [ C{u,v)dC{u,v) - 1. 
Jo Jo 

We are going to study here the behavior of Spearman's rho and Kendall's tau for the 
first and the i^th marginals of the isT-extremal copula in the limit as 7^ 00. We 
denote these measures respectively by px and tk, K > 2. Using the convergence 
result in proposition 2.4, this caracterizes the behavior of these measures for the 
first and the Kth largest order statistics of large samples with continuous parent 
distribution. We point out that p2 = 2/3 and T2 = 1/2 have been obtained [9]. For 
more on measures of dependence of order statistics see and [TT]. We have the 
following result. 
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Proposition 2.5. Both sequences [pk) and {tk) converges to zero as K ^ oo. 

We now describe a simulation algorithm to generate samples from the K-extremal 
copula. The method is based on a technique of conditional sampling to sample from 
multivariate copulas, see for instance Cherubini, Luciano and Vecchiato's book [3]. 
We can resume the procedure with the following steps: 

(i) Put Ci{ui,U2, ■■■,Um) = C(ui,it2, 1, 1) for m = 2, K; 

(ii) Sample ui from the uniform distribution in (0, 1); 

(iii) Sample Um from the conditional distribution Cm{'\ui, ...yUm^i) for m = 
1,...,K; 

We now are going to focus on how to sample Uk from the conditional distribu- 
tion Cfe(-|ui, To sample Um from Cm(.|wi, Wm-i), we sample q from 
t/(0, 1) and we put Um = C^^{q\ui, Um-i)- Therefore we should know explicitly 
Cm{'\ui, We compute it in the following lemma: 

Lemma 2.6. The condicional distribution function of U„i\{Ui,U2, ■■■,Um-i) when 
(Ui, Uk) has distribution given by the K-extremal copula is given by 

Cm{Um\Ui, ...,U,n-l) = 7 T- (2.9) 

Vm-l{Um-l) 

If we now put q — C„i(u„i|ui, Wm-i); we have that: 

Um = C„^(g|-Ui, = l/'m^(5'-^m-l(Um-l)) • 

From definition |2.8| we get 

I t I I -i^j (l0gV'm(g-V'm-l(Um-l)))^ 
Um=Wm\q-i>m~\\Urn~\)) 2^\-^Y ^ • 

Therefore, we solve numerically and then -i/;,„(g.i/'m_i(um-i)) to ob- 

tain Um- 

We plot below a sample of size 200 from the 4-extremal copula. 
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3. Proofs 



Proof of Proposition 2.1: We show that Gk is a X-dimensional distribution 



function with density given by gx- By the definition of qk, the multiple integral 

■■■ / gKivi, ■■■Vk) dyi...dyK 

-OO 'J —oo 

is equal to 

/zi /'min(zi^Z2) nmin{zi ,....zk) 
/ ••■ / gKiyi,--yK) dyi-.dyx. 

-OO ^ — OO J —OO 

Therefore Gk{zi, zk) = Gi<-(zi, min(zi, 22), niin(zi, z^^ )). From now on, 



we suppose that Zi > Z2 > ... > z^- Then, from (2.5), 

Gk{zi,...,zk) = {~1)'' / - / / exp{-A{yK)}l[A'{yj)dyi...dyK, 

J JyK Jys •'1)2 j — i 

where = /i — if ^ > 0, and A^^ — ~oo otherwise. Considering the following 
change of variables in the last integral, Xj — A{yj), for 1 < j < K, we get the 
following integral 

^ + 00 pXK rXs pX2 

Ik{wi,...,wk):^ i-l)'' / - / / e-"^dxi...dxx, 

where Wj = A(zj). To complete the proof. We show by induction that 
Ik{wi, ■■■,wk) = e^^'^Jxiwi, ...,wk) ■ 

For K — 1, a simple verification shows that the result holds. Now suppose that it 
holds for 1 < if < L — 1. For K = L, we perform the first iterated integral in the 
expression for Ik{wi, wk) to obtain that it is equal to 

I' + CO pXK pX3 

(-1)^/ / ... / X2e~'"^dx2...dxK - WiIk-i{w2, ■■■,wk) ■ 

Then perform the first iterated integral in the first term of the previous expression 
to obtain 

p + CO pXk fXi 

(-1)^ / / ... / ^e-^-dxs.-.dxK- 



-^Ik-2{w3, ■■■,wk) - wiIk-i{wi, ....wk) ■ 



Following recursively this procedure we get 



m— 1 •)■ m — 1 7 



Ik{wi,...,wk) = e ^ ^ - ^ -j-lK-jiwj+i,...,WK)- 
By the definition of Jk and the induction hypotheses we complete the proof. □ 



Proof of Proposition \2.2\ Let us fix a limiting extreme value distribution func- 
tion Gk- We have that 

/ X gK{G^^{ui),...,G]^^{uK)) 

ck{uu...,uk) = -j^ . 

n,=i5j(G,- (mj)) 
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Therefore we just apply formulas (2.3) and (2.5) to obtain that Ci<:(ui, Uif ) is 
equal to 

-1 



' A'-l 

n eM-HGjHu,))} 



U 1)! 



HG],\uk)) 
{K-l)l 



From this formula, if we put ipmiu) — exp{— A(G^j^(m)} we get (2.7) in the state- 
ment. Now (2.8 1 is a direct consequence of the explicit formulas for the distribution 
function G„i given in (2.2 1. 



It remains to verify (|2.6|). If we derive both sides of (|2.8|), we get that 



1 = 



m-2 

E 



(-1) 



du 



which implies that 



and 



du 

dlogtpn 

du 



„i(log^„,r-idV;„ 
(m — 1)! du 



(log^^)"- 
(m- 1)! 

(log Ipm 



m— 1 



(to - 1)! 



(3.1) 
(3.2) 



Proof of Proposition |2.3[ Let us fix a limiting extreme value distribution func- 
tion Gk- Then the distribution function of the K-extremal copula is given by 

Ck{ui,...,uk) = Gk{G^^{ui), ...,G]y{uK)) 

for every {ui, ...,uk) G [0, 1]^ which by Proposition 



From (3.1), (3.2) and (2.7) we arrive at (2.6). □ 



2.1 



is equal to 



HKiG^'iu^),mm{G^\u,),G^'iu2)),...,mmiG^'iu,),...,G],\uK))). 



By the definition of Hk, monotonicity and the expression for tjjm in remark [2. 2 [ see 
also the proof of Proposition |2.2[ the previous expression is equal to 



min {ipi{ui)) Jk 

KKK 



log Ui 



\ogmm{ipi{ui)), 



log min (ipiiui)) 

KKK 



Using the definition of in the statement, write the above expression as 
i^K{rK[ui,...,Um)) Jk (-logui, -log'(/'2(?'2(ui, "2)), ■.■,-\ogi'K{rK{ui, ...,u„0)) , 
which completes the proof. □ 



Proof of Proposition 2.4: Let Mi^„, ... , Mk^u be the K-largest order statistics 
of a sample of size n with a given continuous parent distribution function F which 
belongs to the domain of atraction of a GEV distribution. This means that there 
exists (flji)^^! and (6n)^^ sequences of real numbers such that the random vector 



(a„Afi 



..,a„M, 



K,i 



bn) 



converges in distribution to some Gk which is MGEV distribution. By invari- 
ance concerning composition with affine transformations the copula associated to 
(Mi_„, ...,MK,n) and (a„Afi,„ -|- &„, ...,a„M_R-,„ -|- 6„) is (7^^ independently of F. 
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Let Fj^n be the distribution function of UnMj^n + i>n- Therefore, if we define the 
function V^ixi, ...,xk) = (-Fi,„(a;i), Fif^„(x_R-)), {xi, ...,xk) G then 

F„(a„Mi,„ + 6„, a„MK,n + K) (3.3) 

" in) 

has the distribution of the copula Cj^ . 

The K-extremal copula has the distribution of V(Yi, Yk), where V{xi, xk) = 
{Gi{xi), ...,Gk{xk)), {xit--tXk) G M". By Theorem 5.1 in 0, ( |3.3[ ) converges in 
distribution to the K-extremal copula if Vn converges uniformly to V on compact 
intervals, but this is a consequence of Polyas's Theorem which implies that Fj,n 
converges uniformly to Gj since the last is absolutely continuous. □ 



Proof of Proposition 2.5 ; We shall prove through estimates on exact expressions 
that pk — > 0. The analogous result can be applied to tk since pk>tk> 0. This 
last assertion can be verified through Theorem 5.1 of Fredricks and Nelsen in [4]. 
Indeed, according to their terminology, for two order statistics, the largest is always 
left-tail decreasing and smallest is right-tail increasing. 

Applying directly the definition we can write {pK -f 3)/12 as 

ui UK ck{ui, uk) dui...duK- (3.4) 

which we are going to show that converges to 1/4 as A" — > oo resulting in pK — > 0. 
By (2.6) the previous iterated integral can be rewritten as 



ui Uk 



n ^(«^) 



K 



dui 



duK 



{uk) dui...du 



K- 



By induction inl<m<^~l, we show that 



/" 

Jib', 



is equal to 



— yuj) dui...du„ 

l/'2 '(-^sCms)) "''/'2(M2) j^l 



duj 



'0m+l(Wm+l) 



3=0 ■' ■ 



(3.5) 



Indeed, ipi is the identity function in (0, 1) and therefore 



Ul'^^°^^("l)c?"l = (-1)[V'2(W2) " 1] 
i/'2(«2) ""1 



Now suppose that (3.5 1 holds for some 1 < I < K — 2 then 

i-i 



(-1) 



ipi+iiui+i) - 



(log iiji+i{ui+i)y 



J 



d log -0, 
dui-i 



(ui+i) . 



is equal to 



(-1)' 



du 



i+i 



If \ (logV'i+i(w;+i))-' 



J 



and, since ipi+ii^) = 1; integrating on over the interval (tpi^-^^(tpi^2iui+2)) , 1) 
we obtain that (3.5) holds for m = I + 1. 
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Therefore the mtegral in (3.4) is equal to 



du 



-{u){-l 



,A'-1 



K-2 



(log i)K{u)y 



du . 



Put V = ipxiu), u e (0, 1) and uses the power series expansion 

~^ log(w)^ 



J=0 



to write the previous integral as 



»i / oo 



iog{vy 



dv . 



Another change of variables and ( |2.8p allows us to write the integral in (3.4) as 



3 /-+00 



~^ydy 



1=0 j=K~l 



which, since 







can be rewritten as 

K-l oo 

We finish the proof showing that 

K-l oo 

-l+3\ 1 



K- 



l J 2'+J 



{ (-1)"-^ E E (-1) 

From this point we suppose that K is odd, for K even the proof is similar with 
few sign changes. The left hand side term in the previous convergence statement is 
equal to 

K-l oo , . 1 K-l 

l+3\ 1 



E E i'V) 

1=0 j=K-l 

Now apply the identities 



21+] E E ( I 



l + 2j + l\ 1 



2/+2J 



(3.6) 



( )=i / ) 

to write the second term in (3.6) as 



l + 2j\ , fl + 2j 
I - 1 



for I > 1, 



E 



K 



1)22^+1 E E ( / ) 



Therefore (3.6) is equal to 



/=o j=K-l 



^ + 1 



2'+J 
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which is 



oo 



2j + l 

^ \K -IJ 2i ' ^ \K-\ 

j=2K j=K-l 



1 - 



2j 



2{2j - K + S)J 22J+2 



1 



Let y be a random variable with negative binomial distribution with parameters 
K and 1/2. Then the second term in the smn above is equal to 

Y 



E 



1 



2(y - A' + 1) 



1{Y even, Y > 2K} 



which is bounded above by 

Y 



E 



1 - 



< 1 



2{Y -K + 1)J 

2 + K-i 



] 1{2K <Y > 2K + Ki} 



-V{Y > 2K + K-\ 



Y -E[Y] ^ K-4 



2 + 2K-i +2K-^) J \ V2K 
that goes to zero as X — > oo by the central limit theorem. 



V2 



Therefore the hmit of (3.6) as — )■ oo is the same as the hmit of 



oo 



J - 1 \ 1 



^ \K-lJ2i 

j=2K 

which is the probability that a negative binomial distribution with parameters K 
and 1/2 takes a value greater or equal to 2K. This probability converges to 1/2 
again by the Central Limit Theorem. □ 



Proof of Lemma 2.6: Let {Ui,U2, ■■■,Uk) be a random vector whose distribu- 
tion function is C. The conditional distribution of Um given f/i, C/2, t/m-i has 
distribution function 

CmiUm\ui, ...,Um-l)^'P{Um < UrnPl Ul, Um-1 = Mm-l) 



( 



d'^-'^Cr^{ui,...,U„^) 

dui,...,dum-i 



y dui,...,dum-i J 



(3.7) 



for every m — 2, k. 

We first deal with the numerator in (3.7) which by the formula in Proposition 
12.31 can be written as 

-^miUm) Z]j="l^ -iog(^, )) J^_^_.(^^ log tAj^i . . . , - log i/"™ (""m) ) 



(9" 



dui...du,n^i 

If we remove the terms that do not depend on all the variables wi, m,„_i, we 
obtain that the last partial derivative is equal to 



<9" 



-IpniiUm) 117=1 {~log{lp{Uj))) 



Using that 



dlog ip„ 
du 



9mi...5u„i_i 



(3.8) 



(logTA^)"- !^ ' 
(m- 1)! 
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we obtain that (3.8) is equal to 

m— 1 



(3.9) 



Now we consider the denominator in (3.7) which is equal to the density function of 



the (m-l)-extremal copula. Hence it is equal to 

-1 



'm-2 



(j 1) 



(logV>„,_i(u^_i))"'-^ 

(771-2)! 



(3.10) 



Finally replace the expressions in (3.9) and (3.10) respectively in the numerator 



and denominator in ( |3.7[ ) to obtain that 

Cm {Um . . . , Wm- 1 ) 



■(/'m-l(Um-l) 



□ 
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